1 . (currently amended) A method for generating summaries of a video 
comprising the steps of: 

inputting summary sentences, visual information and a section-begin 
frame and a section-end frame for each story in a video; 
selecting a type of presentation; 
locating a set of images available for each story; 

auditing the summary sentences to generate a plurality of summary audio 
segments corresponding to an auditory narration of each story; 

composing the set of images to selectively match the set of images with 
the summary sentences to generate a plurality of summary image segments: 

matching said aud i ted summary Gontonoos audio segments with the sot of 
summary images segments to generate a story summary video for each story in 
the video; and 

combining each of the generated story summaries to generate a summary 
of the video. 

2. (original) The method of claim 1, wherein the visual information 
comprises at least one of a shotlist, a keyframelist and a combination thereof. 

3. (original) The method of claim 1 , wherein the summary sentences 
are generated by: 

generating story boundaries and sentence data using a story extractor; 
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selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 

4. (original) The method of claim 1 , wherein the type of presentation 
comprises an image slide format. 

5. (original) The method of claim 1, wherein the type of presentation 
comprises a poster format. 

6. (original) The method of claim 1 , wherein the section-begin frame 
and the section-end frame determines a story boundary. 

7. (original) The method of claim 1, wherein the step of locating the 
set of images further comprises the steps of: 

collecting a list of images within a story boundary; 

generating a mergelist for clustering images corresponding to each shot 
into visually similar groups; 

deleting images belonging to a largest visually 
similar group; and 
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sampling a remaining list of images to produce the set of images. 

8. (original) The method of claim 7, wherein the sampling is performed 
uniformly with a sampling interval determined by a number of images desired for 
a given length of story summary. 

9. (original) The method of claim 7, wherein the step of sampling 
further comprises selecting a frame number of each proper noun. 

10. (currently amended) A method for generating summaries of a 
video, comprising the steps of: 

inputting story summary sentences, video information and speaker 
segments for each story in a video; 

locating video clips for each story from said video infomnation; 

capturing audio clips from the video clips, said audio clips corresponding 
to the summary sentences; 

composing the video clips to selectively match the video clips with the 
summary sentences to create a plurality of composed video clips: 

combining said corresponding audio clips with the composed video clips to 
generate a story summary video for each story in the video; and 

combining each of the generated story summaries to generate a summary 
of the video. 
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1 1 . (original) The method of claim 10, wherein the summary sentences 
are generated by: 

generating story boundaries and sentence data using a story extractor; 
selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 

12, (currently amended) A program storage device readable by a 
machine, tangibly embodying a program of instructions executable by the 
machine to perform method steps for generating summaries of a video, the 
method steps comprising the steps of: 

providing summary sentences, visual information and a section-begin 
frame and a section-end frame for each story in a video; 
selecting a type of presentation; 
locating a set of images available for each story; 

auditing the summary sentences to generate a plurality of summary audio 
segments corresponding to an auditory narration of each story; 

composing the set of images to selectively match the set of images with 
the summary sentences to generate a plurality of summary image segments; and 

5 



matching said aud i t e d summary audio segments sontoncos with the s e t of 
summary images segments to generate a story summary video for each story in 
the video, wherein a summary of the video is generated by combining each of the 
generated story summaries. 

13. (original) The program storage device of claim 12, wherein the 
visual information comprises at least one of a shotlist, a keyframelist and a 
combination thereof. 

14. (original) The program storage device of claim 12, wherein the 
instructions for generating summary sentences comprise instructions for 
performing the steps of: 

generating story boundaries and sentence data using a story extractor; 
selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 

15. (original) The program storage device of claim 12, wherein the type 
of presentation comprises an image slide format. 
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16. (original) The program storage device of claim 12, wherein the type 
of presentation comprises a poster format. 

17. (original) The program storage device of claim 12, wherein the 
section-begin frame and the section-end frame determines a story boundary. 

18. (original) The program storage device of claim 12, wherein the step 
of locating the set of images further comprises the steps of: 

collecting a list of images within a story boundary; 

generating a mergelist for clustering images corresponding to each shot 
into visually similar groups; 

deleting images belonging to a largest visually 
similar group; and 

sampling a remaining list of images to produce the set of images. 

19. (original) The program storage device of claim 18, wherein the 
sampling is perfomried uniformly with a sampling interval determined by a number 
of images desired for a given length of story summary. 

20. (original) The program storage device of claim 18, wherein the step 
of sampling further comprises selecting a frame number of each proper noun. 
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